MEMO: multi-experiment mixture model analysis of censored data
نویسندگان
چکیده
MOTIVATION The statistical analysis of single-cell data is a challenge in cell biological studies. Tailored statistical models and computational methods are required to resolve the subpopulation structure, i.e. to correctly identify and characterize subpopulations. These approaches also support the unraveling of sources of cell-to-cell variability. Finite mixture models have shown promise, but the available approaches are ill suited to the simultaneous consideration of data from multiple experimental conditions and to censored data. The prevalence and relevance of single-cell data and the lack of suitable computational analytics make automated methods, that are able to deal with the requirements posed by these data, necessary. RESULTS We present MEMO, a flexible mixture modeling framework that enables the simultaneous, automated analysis of censored and uncensored data acquired under multiple experimental conditions. MEMO is based on maximum-likelihood inference and allows for testing competing hypotheses. MEMO can be applied to a variety of different single-cell data types. We demonstrate the advantages of MEMO by analyzing right and interval censored single-cell microscopy data. Our results show that an examination of censoring and the simultaneous consideration of different experimental conditions are necessary to reveal biologically meaningful subpopulation structures. MEMO allows for a stringent analysis of single-cell data and enables researchers to avoid misinterpretation of censored data. Therefore, MEMO is a valuable asset for all fields that infer the characteristics of populations by looking at single individuals such as cell biology and medicine. AVAILABILITY AND IMPLEMENTATION MEMO is implemented in MATLAB and freely available via github (https://github.com/MEMO-toolbox/MEMO). CONTACTS [email protected] or [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Kernel Ridge Estimator for the Partially Linear Model under Right-Censored Data
Objective: This paper aims to introduce a modified kernel-type ridge estimator for partially linear models under randomly-right censored data. Such models include two main issues that need to be solved: multi-collinearity and censorship. To address these issues, we improved the kernel estimator based on synthetic data transformation and kNN imputation techniques. The key idea of this paper is t...
متن کاملBayesian Analysis of Censored Spatial Data Based on a Non-Gaussian Model
Abstract: In this paper, we suggest using a skew Gaussian-log Gaussian model for the analysis of spatial censored data from a Bayesian point of view. This approach furnishes an extension of the skew log Gaussian model to accommodate to both skewness and heavy tails and also censored data. All of the characteristics mentioned are three pervasive features of spatial data. We utilize data augme...
متن کاملAnalysis of Hybrid Censored Data from the Lognormal Distribution
The mixture of Type I and Type II censoring schemes, called the hybrid censoring. This article presents the statistical inferences on lognormal parameters when the data are hybrid censored. We obtain the maximum likelihood estimators (MLEs) and the approximate maximum likelihood estimators (AMLEs) of the unknown parameters. Asymptotic distributions of the maximum likelihood estimators are used ...
متن کاملBayesian analysis of doubly censored lifetime data using two- component mixture of Weibull distribution
In recent years analysis of the mixture models under Bayesian framework has received considerable attention. However, the Bayesian estimation of the mixture models under doubly censored samples has not yet been reported. This paper proposes a Bayesian estimation procedure for analyzing lifetime data under doubly censored sampling when the failure times belong to a two-component mixture of the W...
متن کاملEM algorithms for multivariate Gaussian mixture models with truncated and censored data
We present expectation-maximization(EM) algorithms for fitting multivariate Gaussian mixture models to data that is truncated, censored or truncated and censored. These two types of incomplete measurements are naturally handled together through their relation to the multivariate truncated Gaussian distribution. We illustrate our algorithms on synthetic and flow cytometry data.
متن کامل